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SYSTEM, METHOD AMD SOFTWARE FOR FASTER 
PARITY BASED RAID CREATION 

TECHNICAL FIELD 

The present invention relates generally to 
information handling systems and, more particularly, to 
creating storage management solutions on information 
storage devices . 
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BACKGROUND 

As the value and use of information continues to 
increase, individuals and businesses seek additional ways 
to process and store information. One option available 
5 to users is information handling systems. An information 
handling system generally processes, compiles, stores, 
and/or communicates information or data for business, 
personal, or other purposes thereby allowing users to 
take advantage of the value of the information. Because 

10 technology and information handling needs and 
requirements vary between different users or 
applications, information handling systems may also vary 
regarding what information is handled, how the 
information is handled, how much information is 

15 processed, stored, or communicated, and how quickly and 
efficiently the information may be processed, stored, or 
communicated. The variations in information handling 
systems allow for information handling systems to be 
general or configured for a specific user or specific use 

20 such as financial transaction processing, airline 
reservations, enterprise data storage, or global 
communications. In addition, information handling 
systems may include a variety of hardware and software 
components that may be configured to process, store, and 

25 communicate information and may include one or more 

computer systems, data storage systems, and networking 
systems . 

Network attached storage (NAS) systems is a member 
of the information handling system family. Low- end NAS 
3 0 systems typically support integrated Advanced Technology 

AUSOl: 324714 .1 



ATTORNEY'S DOCKET 
016295.1469 
(DC-05355) 



PATENT APPLICATION 



4 

Attachment (ATA) disk drives and commonly use software 
redundant arrays of inexpensive disks (RAID) based on an 
information handling system operating system volume 
manager to provide RAID- 5 redundancy for its data 
5 volumes . 

The process of creating a RAID- 5 volume consists 
generally of two processes. First, the volume manager 
structures or configuration information on the disk 
drives is updated. Next, the RAID-5 is typically 

10 initialized so that it is consistent, i.e., parity is 
calculated for the stripes and written. 

The creation of a RAID-5 data volume is typically 
expensive in terms of time. The time required for 
building a RAID-5 generally means that it takes the 

15 factory a significant amount of time to build a low-end 
network attached storage system. The time it takes to 
build a software RAID-5 is enormous in comparison to the 
time it takes to build a RAID-5 system based on hardware 
RAID or, to build no RAID at all. As a result of the 

20 increase in time spent in the factory build process, the 
cost of manufacturing a low end network attached storage 
system is significantly increased. 

In some factories, build times are on the order of 
three to four hours with one -hundred- twenty (12 0) 

25 gigabyte hard drive devices with drive sizes due to 

increase, these build times will get progressively worse. 
The next drive size expected for implementation in low 
end network attached storage systems is likely to be on 
the order of two-hundred-f if ty (250) gigabytes. Such a 

30 drive size will approximately double the time it takes to 
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build a RAID- 5 low end network attached storage (NAS) 
system. 

Experimentation shows that most of the time spent in 
building a RAID-5 data volume is spent in the 
5 initialization process. In particular, the read and 
write operations and calculation of parity for the 
initialization of the RAID-5 build process consume the 
vast majority of time. 
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SUMMARY 

In accordance with teachings of the present 
disclosure, a system, method and software for enabling 
faster parity based RAID creation, such as RAID-3 and 
5 RAID- 5 are provided. 

In one aspect, teachings of the present disclosure 
provide a method for volume manager based redundant array 
of inexpensive disk creation. The method preferably 
includes obtaining information on a data portion of a 

10 disk RAID volume, verifying that disks of the RAID volume 
are zeroed and aborting RAID creation if the disks are 
not zeroed. Further, the method preferably also provides 
monitoring input /output (I/O) operations between an 
information handling system volume manager and an 

15 information handling system disk driver, as well as 
intercepting all I/O operations between the volume 
manager and the disk driver. If an intercepted I/O 
operation is a write operation directed to the data 
portion of the RAID volume, the method preferably 

2 0 provides for returning a success status to a requesting 
application. If an intercepted I/O operation is a read 
operation of the data portion of the RAID volume, the 
method preferably provides for returning a zeroed buffer 
to a requesting application. In addition, if an 

25 intercepted I/O operation is a non-data portion access 

I/O operation, the method preferably provides for passing 
the non-data portion access I/O operation to the disk 
driver for processing. 

In another aspect, teachings of the present 

30 invention provide an information handling system 
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including at least one processor, a memory and at least 
three information storage devices operably coupled to the 
processor. In addition, a program of instructions 
storable in the memory and executable by the processor is 
5 included and is preferably operable to intercept 

input /output (I/O) operations during creation of a RAID 
on the information storage devices. In addition, the 
program of instructions is preferably further operable to 
process I/O operations directed to accessing RAID disk 

10 structures, process I/O operations directed to accessing 
RAID configuration information as well as provide for the 
processing of I/O operations directed to accessing a data 
portion of the RAID. 

In a further aspect, teachings of the present 

15 disclosure provide a computer readable medium including a 
program of instructions, the program of instructions 
implementing a method for RAID creation. The program of 
instructions is preferably operable to cause an 
information handling system to monitor I/O operations 

2 0 submitted for processing by an information handling 
system disk driver and to filter I/O operations 
associated with a data portion of the RAID. 

A technical advantage provided by teachings of the 
present disclosure includes enabling lower cost network 

25 attached storage systems to be developed through faster 
volume manager based RAID creation. 

In another aspect, teachings of the present 
disclosure provide the technical advantage of enabling 
all systems employing volume manager RAID to reduce RAID 

30 creation times. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

A more complete understanding of the present 
embodiments and advantages thereof may be acquired by 
referring to the following description taken in 
5 conjunction with the accompanying drawings, in which like 
reference numbers indicate like features, and wherein: 

FIGURE 1 is a block diagram depicting an exemplary 
embodiment of an information handling system according to 
teachings of the present disclosure; 
10 FIGURE 2 illustrates a block diagram depicting an 

exemplary embodiment of a driver stack according to 
teachings of the present disclosure; 

FIGURE 3 is a flow diagram depicting an exemplary 
embodiment of filter driver initialization according 
15 teachings of the present disclosure; 

FIGURE 4 is a flow diagram depicting an exemplary 
embodiment of filter driver operation according to 
teachings of the present disclosure; and 

FIGURE 5 is a flow diagram depicting an exemplary 
2 0 embodiment of filter driver use in a factory RAID build 

process according to teachings of the present disclosure. 
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DETAILED DESCRIPTION 

Preferred embodiments and their advantages are best 
understood by reference to FIGURES 1 through 5, wherein 
like numbers are used to indicate like and corresponding 
5 parts . 

For purposes of this disclosure, an information 
handling system may include any instrumentality or 
aggregate of instrumentalities operable to compute, 
classify, process, transmit, receive, retrieve, 

10 originate, switch, store, display, manifest, detect, 
record, reproduce, handle, or utilize any form of 
information, intelligence, or data for business, 
scientific, control, or other purposes. For example, an 
information handling system may be a personal computer, a 

15 network storage device, or any other suitable device and 
may vary in size, shape, performance, functionality, and 
price. The information handling system may include 
random access memory (RAM) , one or more processing 
resources such as a central processing unit (CPU) or 

2 0 hardware or software control logic, ROM, and/or other 
types of nonvolatile memory. Additional components of 
the information handling system may include one or more 
disk drives, one or more network ports for communicating 
with external devices as well as various input and output 

2 5 (I/O) devices, such as a keyboard, a mouse, and a video 

display. The information handling system may also 
include one or more buses operable to transmit 
communications between the various hardware components. 
Referring first to FIGURE 1, a block diagram of an 

3 0 information handling system, such as a network attached 
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Storage (NAS) appliance, is shown, according to teachings 
of the present disclosure. Information handling system 
10 preferably includes at least one microprocessor or 
central processing unit (CPU) 12 . CPU 12 may include 
5 processor 14 for handling integer operations and 

coprocessor 16 for handling floating point operations. 
CPU 12 is preferably coupled to cache 18 and memory 
controller 20 via CPU bus 22. System controller I/O trap 
24 preferably couples CPU bus 22 to local bus 2 6 and may 

10 be generally characterized as part of a system 

controller. Main memory 2 8 of dynamic random access 
memory (DRAM) modules is preferably coupled to CPU bus 22 
by a memory controller 20. 

Basic input /output system (BIOS) memory 3 0 is also 

15 preferably coupled to local bus 26. FLASH memory or other 
nonvolatile memory may be used as BIOS memory 30. A BIOS 
program (not expressly shown) is typically stored in BIOS 
memory 30. The BIOS program preferably includes software 
which facilitates initialization of information handling 

20 system 10 devices such as a keyboard (not expressly 

shown) , a mouse (not expressly shown) , or other devices 
as well as aids in the initial loading of the operating 
system. 

Bus interface controller or expansion bus controller 
25 32 preferably couples local bus 26 to expansion bus 34. 
Expansion bus 34 may be configured as an Industry 
Standard Architecture ("ISA") bus or a Peripheral 
Component Interconnect ( " PCI " ) bus . Other NAS 
deployments may include alternative expansion bus 
30 technologies. 
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Interrupt request generator 36 is also preferably 
coupled to expansion bus 34. Interrupt request generator 
3 6 is preferably operable to issue an interrupt service 
request over a predetermined interrupt request line in 
5 response to receipt of a request to issue interrupt 
instruction from CPU 12 . 

I/O controller 38 is also preferably coupled to 
expansion bus 34. I/O controller 38 preferably 
interfaces to Advanced Technology Attachment (ATA) hard 

10 drives 40, 42, 44 and 46. While reference herein is made 
to ATA hard drive devices, it should understood that 
teachings of the present disclosure may also be 
implemented with other hard drive device or storage 
technologies including, but not limited to. Serial 

15 Advanced Technology Attachment (SATA) devices. Small 
Computer Systems Interface (SCSI) devices, and Fiber 
Channel devices. 

Network interface controller 48 is preferably 
provided and enables information handling system 10 to 

20 communicate with communication network 50, e.g., an 

Ethernet network. Communication network 50 may include a 
local area network ("'LAN"), wide area network ('"WAN"), 
Internet, Intranet, wireless broadband or the like. 
Network interface controller 48 preferably forms a 

25 network interface for communicating with other 

information handling systems (not expressly shown) 
coupled to communication network 50. An information 
handling system' s communication components generally 
include hardware as well as software components. 

30 Examples of hardware components include network interface 
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controller 48 and communication network 50. Examples of 
software components specific to NAS may include file 
server services and network administration services. 

Real-time clock (RTC) 64 may also be coupled to I/O 
5 controller 38. Real-time clock 64 may be programmed to 
generate an alarm signal at a predetermined time as well 
as to perform other operations. 

In general, teachings of the present disclosure 
describe a mechanism by which RAID volume configuration 

10 information can be written to storage disks without 

requiring the initialization of all the blocks in each 
RAID volume. In one embodiment, teachings of the present 
disclosure provide for using a filter between an 
information handling system 10 operating system disk 

15 driver and volume manager driver. Once enabled, the 
filter driver will preferably intercept all I/O 
transactions between the volume manager and the disk 
driver. In operation, the filter driver will preferably 
allow only those I/O transactions that access the disk 

2 0 structures and configuration information of a RAID volume 

to go through. All other I/O transactions would 
generally be handled by the filter driver. Any write 
operations directed to the data portion of the RAID 
intercepted by the filter driver, will preferably be 
25 returned to the write requesting application with a 
"good" or "success" status. Any intercepted read 
operations will preferably be returned with a zero 
buffer, which is typically the correct parity for zeroed 
out disk drives, to the read requesting application. One 

3 0 advantage of teachings of the present disclosure is that 
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a RAID creation implementation in accordance with 
teachings of the present disclosure generally handles the 
case where the volume manager either writes data to all 
the disk drives or uses read-modify-write to initialize a 
5 RAID- 5 configuration. 

In general, a RAID- 5 data storage configuration 
involves a number of independent data disks with 
distributed parity blocks. In a typical RAID- 5 
implementation, each data block is written on a data 

10 disk. The parity for blocks in the same ranks is 

generally generated on writes, recorded in a distributed 
location and checked on reads. At a minimum, RAID level 
5 requires a minimum of three drives to implement. 

Referring now to FIGURE 2, a block diagram depicting 

15 a driver stack incorporating teachings of the present 

disclosure is shown generally at 88. As mentioned above, 
one implementation of teachings of the present disclosure 
enables a program of instructions, storable on a 
computer- readable medium, to be implemented on 

20 information handling system 10 and to perform the various 
preferred operations discussed herein. In one 
embodiment, a filter driver incorporating teachings of 
the present disclosure may be implemented alongside one 
or more drivers included in an operating system running 

25 on information handling system 10. As depicted generally 
at 88, an information handling system 10 having an 
operating system running thereon and incorporating a 
filter driver as taught by the present disclosure, may 
include a file system driver 90, a volume manager driver 

30 92, a filter driver 94 according to teachings of the 
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present disclosure, a disk driver 96 and an Advanced 
Technology Attachment (ATA) driver 98. Alternative 
driver stack arrangements are contemplated within the 
spirit, scope, and teachings of the present disclosure. 
5 In a conventional operating system, read, write and 

other operations are typically passed from one or more 
applications running on information handling system 10 to 
file system driver 90. Using intelligence and operations 
incorporated in file system driver 90, the operations 

10 received there may be reviewed and sorted as necessary. 
Selected ones of the operations received by file system 
driver 90 may be passed to volume manager driver 92 . In 
a conventional operating system, the intelligence and 
operations incorporated in volume manager driver 92 may 

15 be applied to operations received thereby and 

subsequently passed to disk driver 96. Applying the 
intelligence and operations incorporated in disk driver 
96, operations received thereby may then be passed to ATA 
driver 98 for processing in accordance with its 

20 intelligence and operations. 

As shown in FIGURE 2, generally at 88, one 
implementation of teachings of the present disclosure 
involves enabling a filter driver incorporating teachings 
of the present disclosure between volume manager driver 

25 92 and disk driver 96. While discussion herein may refer 
primarily to incorporation of filter driver 94 between 
volume manager driver 92 and disk driver 96 of driver 
stack 88, other implementations are considered within the 
spirit and scope of the present disclosure. 
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Referring now to FIGURE 3, a flow diagram depicting 
one embodiment of the initialization of filter driver 94 
is shown generally at 100. Upon initialization of method 
100 at 102, method 100 preferably proceeds to 104 where 
5 information regarding a data portion of the RAID volume 
on disk may be obtained, e.g., obtaining the beginning 
and ending addresses of the RAID volume's data portion, 
the size of the data portion, etc. In general, the 
actual process of determining the extents of the data 

10 portion of a RAID on a disk, as opposed from RAID 

configuration information, depends on the implementation 
of the operating system's volume manager driver 92. In 
one aspect, this information may be read from the disks. 
In an alternate aspect, this information may be obtained 

15 by using one or more operating system and/or volume 
manager application program interfaces (API) . Once 
desired information regarding the data portion of the 
RAID volume on disk has been obtained at 104, method 100 
preferably proceeds to 106. 

20 At 106 of method 100, HDD 40, 42, 44 and 46 of 

information handling system 10 are preferably read to 
determine their state. In particular, at 106 of method 
100, HDD 40, 42, 44 and 46 are preferably reviewed to 
determine whether the information storage devices have 

25 been zeroed out. A number of methodologies may be used 
to determine if HDD 40, 42, 44 and 46 are zeroed out. 
For example, one method to determine whether HDD 40, 42, 
44 and 46 are zeroed out is to do sample reads within the 
full range of the disk logical block addresses (LBA) and 

30 to verify that the data read back is all zeroes. In such 
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a methodology, the greater the number of samples, the 
greater the probability that the disks are in fact fully 
zeroed out . 

At 108, an evaluation or review of the data acquired 
5 at 106 is preferably made. If at 108 it is determined 
that the disk drives of information handling system 10 
are in fact zeroed out, method 100 preferably proceeds to 
110 where filter driver 94 of the present disclosure may 
begin monitoring I/O operations between a volume manager 

10 driver and a disk driver of an information handling 

system operating system, such as volume manager driver 92 
and disk driver 96. Once monitoring has been initiated 
at 110, method 100 may proceed to 112. 

Alternatively, if at 108 it is determined that the 

15 information obtained at 106 shows that the disk drives of 
information handling system 10 are not in a zeroed out 
state, method 100 preferably proceeds to 114 where the 
filter driver of the present disclosure is preferably 
disabled. If the filter driver 94 is disabled at 114, 

20 method 100 preferably ends at 112, As such, in one 

embodiment, the disk drives of an information handling 
system in which a filter driver incorporating teachings 
of the present disclosure is to operate may be required 
to be in a zeroed out state. An alternate implementation 

25 of teachings of the present disclosure may incorporate 

one or more of a plurality of methods for zeroing out the 
disk drives of a selected information handling system. 

Referring now to FIGURE 4, a flow diagram depicting 
one embodiment of filter driver 94 operation is shown 

30 generally at 116. Upon initialization at 118, such as 
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through fulfilling step 110 of method 100 illustrated in 
FIGURE 3, method 116 preferably proceeds to 120. At 120, 
preferably all I/O operations between volume manager 92 
and disk driver 96 of driver stack 88 are intercepted by 
5 filter driver 94. Upon interception of an I/O operation 
at 120, method 116 preferably proceeds to 122. 

At 122, each I/O operation is preferably evaluated 
to determine whether the I/O operation concerns the data 
portion of a selected RAID volume or configuration 

10 information regarding the RAID volume (s) being 

established on one or more of disk drive devices 40, 42, 
44 and 46 of information handling system 10. As such, at 
122, each intercepted I/O operation is preferably 
evaluated to determine whether the I/O operation concerns 

15 the data portion of the selected RAID volume. If at 122 
the I/O operation being evaluated is determined to 
concern the data portion of the selected RAID volume, 
method 116 preferably proceeds to 124 . Alternatively, if 
at 122 the I/O operation being evaluated is determined 

20 not to concern the data portion of a RAID volume, e.g., 
the I/O operation concerns RAID configuration 
information, method 116 preferably proceeds to 126 where 
the I/O operation is preferably passed to the next driver 
in the driver stack for processing associated therewith, 

25 for example, disk driver 96. 

At 124, the I/O operation concerning the data 
portion of the RAID volume identified at 122 is 
preferably evaluated to determine whether the I/O 
operation is a read operation or a write operation. If 

30 at 124, it is determined that the I/O operation 
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concerning the data portion of the RAID volume is neither 
a read operation nor a write operation, method 116 
preferably proceeds to 126 where the I/O operation may be 
passed to the next driver in the driver stack for 
5 processing associated therewith, for example, disk driver 
96. If, however, it is determined at 124 that the I/O 
operation is either a read or write operation, method 116 
preferably proceeds to 12 8. 

At 12 8, the I/O operation is preferably 

10 distinguished to be either a read operation or a write 

operation. At 128 of FIGURE 4, method 116 may determine 
whether or not the I/O operation is a read operation. If 
at 128 it is determined that the I/O operation is not a 
read operation, i.e., it is a write operation, method 116 

15 preferably proceeds to 130. At 130, the I/O operation is 
acknowledged as a write operation and, according to 
teachings of the present disclosure, instead of 
processing the I/O operation in accordance with normal 
write operation processing and procedures, a good or 

2 0 success status is preferably returned to the write 

requesting application in lieu of actually processing the 
I/O operation. In one embodiment, instead of completing 
the I/O operation by passing it to the next driver, 
"good", "successful" or a similar status is preferably 
25 returned by filter driver 94 to the write requesting 
application . 

Alternatively, if at 128 the I/O operation is 
determined to be a read operation, method 116 preferably 
proceeds to 132 . Instead of actually processing the read 

3 0 I/O operation in accordance with normal I/O processing 
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procedures, filter driver 94, in one embodiment, will 
preferably return a buffer filled zeroes for the 
requested read I/O operation size to the read requesting 
application. In the factory build process, the correct 
5 parity for disk drives in a RAID- 5 build is zero. As 

such, in implementing filter driver 94 in a RAID-5 build 
process, method 116 at 132 in conjunction with the disk 
drive zeroed out verification of method 100 may eliminate 
time allotted to a read operation during the RAID build 

10 process by returning a buffer filled with zeroes for the 
requested I/O operation size. Upon completion of 126, 
130 and 132, method 116 preferably returns to 120 where 
the next I/O operation may be intercepted and evaluated 
generally in accordance with method 116 as described 

15 above. 

Referring now to FIGURE 5, a flow diagram depicting 
a RAID-5 factory build process according to teachings of 
the present disclosure is shown generally at 140. Upon 
initialization at 142, method 140 preferably proceeds to 

20 144. At 144, a filter driver incorporating teachings of 
the present disclosure is preferably loaded onto an 
information handling system 10 desired to be configured 
with a RAID-level-5 . Upon a filter driver loading at 
144, method 14 0 preferably proceeds to 146. 

25 At 146, the RAID-5 build process is preferably 

initialized. Upon initialization of the RAID-5 build 
process at 146, method 140 preferably proceeds to 148. 

At 148, the filter driver loaded at 144 is 
preferably enabled at 148. In one embodiment, methods 

3 0 100 and 116, of FIGURES 3 and 4, respectively, may be 
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implemented generally in conjunction with step 148 of 
method 140. Alternate implementations of methods 100 and 
116 may also be effective. Such alternative 
implementations are considered within the spirit and 
5 scope of the present disclosure. Once the filter driver 
of the present disclosure has been initialized and 
enabled, such as at 148, method 14 0 preferably proceeds 
to 150 and 152 for implementation of a RAID completion 
monitoring loop. 

10 As operations in the RAID build completion 

monitoring loop, the status of the RAID build is 
preferably monitored at 150. At 152, a determination is 
preferably made as to whether the RAID initialization has 
been completed- If at 152 it is determined that the RAID 

15 build/initialization process has not completed, method 

140 preferably returns to 150 where continued monitoring 
of RAID initialization status may be checked. 
Alternatively, if at 152 it is determined that the RAID 
build/initialization process has been completed, method 

20 140 preferably proceeds to 154 where it may end. In an 
alternate embodiment, prior to method 14 0 ending at 154, 
the filter driver of the present disclosure may be 
removed from information handling system 10. For 
example, the filter driver loaded 144 of method 14 0 may 

25 be deleted prior to method 140 ending at 154. 

Alternative and/or substitute operations may be 
incorporated into methods 100, 116 and 140 without 
departing from the spirit and scope of teachings of the 
present disclosure . 
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Although the disclosed embodiments have been 
described in detail, it should be understood that various 
changes, substitutions and alterations can be made to the 
embodiments without departing from their spirit and 
5 scope . 
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